Wilson Wu

mentions 1 type Person feed RSS

// recent coverage 1 mentions

15:46

2026-05-28

lesswrong.com

ai-safety

ARC's "Outperforming Random Sampling" explained

ARC researcher Eleni Angelou and her team have proposed a new formal goal for mechanistic interpretability that focuses on outperforming random sampling when predicting neural network behavior. The fr…

// co-occurs with top 2 entities

ARC 1 Eleni Angelou 1